SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: DUAN, RDXANKE 
RUBEN, STEVEN 

(ii) TITLE OF INVENTION: Parotid Secretory Protein 

(iii) NUMBER OF SEQUENCES: 18 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: HUMAN GENOME SCIENCES, INC. 
' (B) STREET: 9410 KEY WEST AVENUE- 
CO CITY: ROCKVILLE 
• (D) STATE: MD' 

(E) COUNTRY: US 

(F) ZIP: 20850 

(V) COMPUTER READABLE EDRM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible / 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D> SOFTWARE: PatentIn Release #1 . 0, . Version #1.30 

(vi) .CURRENT APPLICATION- DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(Viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: BROOKES, ANDERS A. 

(B) REGISTRATION NUMBER: 36,373 

(C) REFERENCE/ COCKET NUMBER: PF34-8 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (301) 309-8504 

(B) TELEFAX: (301) 301-8439 - 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1028 base pairs 

(B) TYPE: nucleic acid 

(C) StRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii). MOLECULE TYPE*": DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 49. .795 

(ix) FEATURE: 

^ (A) NAME/KEY: sig_peptide 

(B) LOCATION: 4 9.. 100 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 103.. 795 



(xi)-. SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CACGAGATTT CATGAGCATC CTCCTCTAAA CGCGTGTCAA GACAAAAG ATG CTT CAG 

Met Leu Gin 
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CTT TGG AAA CTT GTT CTC CTG TGC GGC GTG CTC ACT GGG ACC TCA GAG 
Leu Trp Lys Leu Vai Leu Leu Cys Gly Val Leu Thr Gly Thr Ser Glu 
-15 -10 -5 1 

TCT CTT CTT GAC AAT CTT GGC AAT GAC CTA AGO AAT GTC GTG GAT AAG 
Ser Leu Leu Asp Asn Leu Gly Asn Asp Leu Ser Asn Val Val Asp Lys 
5 10 15 

CTG GAA CCT GTT CTT CAC GAG GGA CTT GAG ACA GTT GAC AAT ACT CTT 
Leu Glu Pro Val Leu His Glu Gly Leu Glu Thr Val Asp Asn Thr Leu 
20- 25 .30 

AAA GGC ATC CTT GAG AAA CTG AAG GTC GAC CTA GGA GTG CTT CAG AAA 
Lys Gly lie Leu Glu Lys Leu Lys Val Asp Leu Gly Val Leu Gin Lys 
35 40 45 .' 

TCC AGT GCT TGG CAA CTG GCC AAG CAG AAG GCC CAG GAA GCT GAG AAA 
Ser Ser Ala Trp Gin Leu Ala Lys Gin Lys Ala Gin Glu Ala Glu Lys 
50 55 • 60 65 

TTG CTG AAC AAT GTG ATT TCT AAG CTG CTT CCA ACT AAC ACG GAC ATT 
Leu Leu Asn Asn Val I-le Ser Lys* Leu Leu Pro Thr Asn Thr Asp lie 
70 75 80 

TTT GGG- TTG AAA ATC AGC AAC TCC CTC ATC CTG GAT GTC AAA GCT GAA 
Phe Gly Leu Lys lie Ser Asn Ser Leu lie Leu Asp Val Lys Ala Glu 
85 90 95 

CCG ATC GAT GAT GGC AAA GGC CTT AAC CTG AGC TTC CCT GTC .ACC GCG 
Pro lie Asp Asp Gly Lys Gly Leu Asn Leu Ser Phe Pro Val Thr Ala 

100 105 110 

AAT GTC ACT GTG GCC GGG CCC ATC ATT GGC CAG ATT ATC AAC CTG AAA 
Asn Val Thr Val Ala Gly ' Pro lie lie Gly Gin lie He Asn Leu Lys 
115 120- 125 

GCC TCC TTG GAC CTC CTG ACC GCA GTC ACA ATT GAA ACT GAT CCC CAG 
Ala Ser Leu Asp Leu Leu Thr Ala . Val- Thr He Glu Thr Asp Pro Gin 
130 135 140 - 145 

AGA CAC CAG CCT GTT" GCC GTC CTG GGA GAA TGC GCC AGT GAC CCA ACC 
Thr His Gin Pro Val Ala Val Leu Gly Glu Cys Ala Ser Asp Pro Thr 
150 155 160 

AGC ATC TCA CTT TCC TTG CTG GAC AAA CAC AGC CAA ATC • ATC AAC AAG 
Ser He Ser Leu Ser Leu Leu Asjp Lys His Ser Gin He He Asn Lys 
165" 170 175 

TTC GTG AAT AGC GTG ATC AAC ACG CTG AAA AGC ACT GTA TCC TCC CTG 
Phe Val Asn Ser Val He Asn Thr Leu Lys Ser thr Val Ser Ser Leu 
180 ' ' - 185 190 

CTG CAG AAG GAG ATA TGT CCA CTG ATC CGC ATC TTC ATC CAC TCC CTG 
Leu Gin Lys Glu He Cys Pro Leu He Arg He Phe He His Ser Leu 
195 200 . -205 

GAT GTG AAT GTC ATT CAG CAG GTC GTC GAT AAT CCT CAG CAC AAA ACC 
Asp Val Asn Val He Gin Gin Val Val Asp Asn Pro Gin His Lys Thr 
2i0 . 215 220 ' 225 

CAG CTG CAA ACC CTC ATT TGAAGAGGAC GAATGAGGAG GACCACTGTG 
Gin Leu Gin. Thr Leu He 

230 - . . - 

GTGCATGCTG ATTGGTTCCC AGTGGCTTGC CCCACCCCCT TATAGCATCT CCCTCCAGGA 



AGCTGCTGCC ACCACCTAAC CAGCGTGAAA GCCTGAGTCC CACCAGAAGG ACCTTCCCAG 



r 



ATACCCCTTC TCCTCACAGT CAGAAQi-GCA GCCTCTACAC ATGTTGTCCT GCCCCTGGCA 
ATAAAGGCCC ATTTCTGCAA AAA 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 9 amino, acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Leu Gin Leu Trp Lys Leu Val Leu. Leu Cys Gly Val Leu Thr Gly . 
^18 -15 -10 -5 

Thr Ser Glii Ser Leu Leu Asp Asn Leu"' Gly Asn Asp Leu Ser Asn Val 

.1 ^ . 5 10 

Val Asp Lys Leu Giu Pro Val. Leu His Glu Gly Leu Glu Thr Val Asp 

15 '20 25 30 

Asn Thr Leii Lys Gly lie Leu^Glu Lys Leu Lys Val Asp Leu Gly Val 
35 40 45 

Leu. Gin Lys Ser Ser Ala Trp Gin Leu Ala Lys Gin Lys Ala Gin Glu 
50 ' - 55 60 . 

■Ala Glu Lys L^u Leu Asn Asn Val lie Ser Lys Leu Leu Pro Thr Asn 

.65 .70 75 

Thr Asp lie Phe Gly Leu Lys lie Ser Asn Ser Leu lie Leu Asp Val 
80 '85 .90 ^ - 

Lys Ala Gly Pro lie Asp Asp Gly Lys Gly Leu Asn Leu Ser Phe Pro 
95 ' 100 105 110 

Val Thr Ala Asn Val Thr Val Ala Gly Pro He He Gly Gin He He 
115 - 120 . 125, 

Asn Leu Lys Ala Ser Leu Asp Leu Leu Thr Ala Val Thr He Glu fhr 
130 135 140 

Asp Pro Gin Thr His Gin Pro Val Ala Val Leu.. Gly Glu Cys Ala Ser 
145 / 150 155 

Asp Pro Thr Ser He Ser Leu Ser Leu Leu Asp Lys His Ser Gin He " 
160 165 - ' 170 

He Asn Lys Phe Val Asn Ser Val He Asn Thr Leu Lys Ser Thr Val 
175 ■ ' 180 • * 185 1'90 

Ser Ser Leu Leu Gin Lys Glu He Cys Pro Leu He Arg He Phe He 
195 200 205 

His Ser Leu Asp Val Asn Val He Gin Gin Val Val Asp Asn. Pro Gin 
210 ' 215 . * 220 

His. Lys Thr G^n Leji Gin Thr Leu He 
225 ^ 230 • 

(2) INEDRMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: . ' ' ■ ' 

(A) LENGTH: 235 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



Met. Phe Gin Leu Gly Ser Leu Val Val Leu Cys Gly Leu Leu lie Gly 
1 , 5 10 15 



Asn Ser Glu Ser Leu Leu Gly Giu Leu Gly Ser Ala Val Asn Asn Leu 
15 "20 25 ■ 30 . ' 

Lys lie Leu Asn Pro Pro Ser Glu Ala Val Pro Gin Asn- Leu Asn Leu 
35 ' 40 . . 45 

20 Asp Val Glu Leu Leu Gin Gin Ala Thr Ser Trp Pro Leu Ala Lys Asn 

50 55 - 60 



Ser lie Leu Glu Thr Leu Asn Thr Ala Asp Leu Gly Asn Leu Lys Ser 

65 70 . ■ 75. 80. 

Phe Thr Ser Leu Asn Gly Leu Leu Leu Lys lie Asn Asn Leu Lys Val 

85 90 . 95 



Leu Asp Phe Gin Ala Lys Leu Ser Ser Asn Gly Asn Gly He Asp Leu 

30 100 105 110 

Thr Val Pro Leu Ala Gly Glu Ala Ser Leu Val Leu Pro Phe He Gly 
115 120 125 

-35 Lys Thr Val Asp .He Ser Val Ser Leu Asp Leu He Asn Ser Leu Ser 

130 . ' 135 140 



lie Lys Thr. Asn Ala Gin Thr Gly Leu Pro Glu Val Thr lie Gly Lys 

145 . . 150 155 160 

Cys Ser Ser Asn Thr Asp Lys lie Ser He- Ser Leu Leu Gly Arg. Arg 

■ 165 - 170 175 



Leu Pro He -He Asn Ser He Leu Asp Gly Val Ser Thr Leu Leu Thr 
45 180 - * 185 190 

Ser Thr .Leu Ser Thr Val Leu Gin Asn Phe Leu Cys Pro Leu Leu Gin 
195 200 205 

50 Tyr Val Leu Ser Thr Leu Asn Pro Ser Val Leu Gin Gly Leu Leu Ser 

210 215 '220 

Asn Leu Leu Ala Gly Gin Val Gin Leu Ala Leu . 
225 230 235 



(2) INFORMATION EDR SEQ ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 235 amino acids 
60 (B) ■ TYPE :;■ amino acid 

"(C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: .protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
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Met Phe Gin Leu Gly Ser Leu Val Val Leu Cys Gly Leu Leu lie Gly 
15 10 15 

Thr Ser Glu Ser Leu Leu Gly Asp Val Ala Asn Ala Val Asn Asn Leu 
20 • 25 30 

Asp lie Leu Asn Ser Pro Ser Glu Ala Val Ala Gin Asn Leu Ash Leu 
35 '40 45 ' 

Asp Val Gly Ser Leu Gin Gin Ala Thr Thr Trp Pro Ser Ala Lys Asp 
50 55 ' 60 

Ser lie Leu Glu Thr Leu Asn Lys Val Glu Leu Gly Asn Ser Asn Gly 
65 ■ 70 75 80 

Phe Thr Pro Leu Asn Gly Leu Leu Leu Arg Val Asn Lys Phe Arg Val 
85 90 95 

Leu AsD Leu Gin Ala Gly Leu Ser Ser Asn Gly Lys Asp lie. Asp Leu 
100 . . 105 110 

Lys Leu Pro Leu Val Phe Glu lie, Ser Phe Ser Leu Pro Val lie Gly 
115 120 125 

Pro Thr Leu Asp Val Ala Val Ser Leu Asp Leu Leu Asn Ser Val Ser 

130 135 140 

Val Gin Thr Asn Ala Gin Thr Gly Leu Pro Gly Val Thr Leu Gly Lys 
145 . ' 150 ■ 155 160 

Cys Ser Gly Asn Thr Asp Lys lie Ser lie Ser Leu Leu Gly Arg Arg 
165 170 175 

Leu Pro Phe Val Asn Arg He Leu Asp .Gly Val Ser Gly Leu Leu Thr 
. '■ . 180 ' 185 ■ 190 

Gly Ala Val Ser lie Leu Leu Gin Asn He Leu Cys Pro Val Leu Gin 
195' 200 205 

Tyr Leu Leu Ser Thr Met Ser Gly Ser Ala He Gin Gly Leu Leu Ser 
210 . 215 220 

Asn. Val Leu Thr Gly Gin Leu Ala Val Pro Leu 
225 230 235 

(2) INEORMATION FOR SEQ ID N0:5: " ' • - ■ 

(i) . SEQUENCE CHARACTERISTICS: - * 

(A) LENGTH: 206 amino acids . 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear. 

* (ii) MOLECULE TYPE: protein - 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met -phe Gin Leu Gly Ser Leu Val Val Leu Cys Gly Leu Leu lie Gly 

1 5 10. - . " 15 . 

Thr Ser Gly" Ser Leu Phe Asp He Phe Gin Asn Pro Glu Leu Asp Val 

•20 25 - 30 

Glu Ser Val Trp Ser Glu He Asn Tyr' Arg He Arg Tyr Ala Leu Glu 

35 40 • 45 



Thr Met Asp Leu Asp Met Leu Ala .Asp Tyr Leu Ser Lys Arg Gly lie 
.50 55 60 

Glu Leu Lys lie Lys Asp Leu Arg lie Leu Asn Leu Asn His Glu Val 
65 70 75 80 

Ser Pro Asn Gly Asp Glu Val Thr Leu Lys Met Pro Met Ala Leii Asn 
85 90 95 

Ala Ser Leu Ser Leu Pro Ala Arg Asp 'Leu Thr Thr Asp Val Ser lie 
100 ' 105 110 . 

Ser Met Glu Ala lie Thr Ser Phe Ala lie Glu Lys Asp Pro Lys Thr 
115 ' 120 - 125- 

Gly Arg Arg Val Leii Asn Met Gin Arg Cys Ser Leu Asn Thr Asp Asn 
130 135 140 

Thr Ser lie Ser Leu Leu Asn Arg Lys Ser Asn Phe Val Asn Leu Ala 
145 ■ . 150 • 155 " 160 

Leu Asp Ser Ala Leu Tyr Leu lie Lys Arg Gly Leu Thr Leu Pro Val 
165 170 175 

Arg Arg Gin Leu Cys Pro Val Leu Gin Leu lie lie. Ser Asn Thr Phe 

180 ^ 185 190 

His Pro Asp Glu lie Ser "Asn Pro Glii Thr Ala lie Ser Thr 
195 . . 200 . 205 . 

(2) INFORMATION FOR SEQ ID NO:6: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs • . * • 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single • ' - 
' (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CTACAGCCAT GGAGTCTCTT CTTGACAATC TTGGCAATG 
(2) INFORMATION EOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) . STRANDEDNESS: 'single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: E^iA '(genomic) . 



(xi) SEQUENCE DESCRIPTION: .SEQ ID N0:7: 
CATCGCG(3AT CCAATGAGGG TTTGCAGCTG GGTTTT 
(2) INFORMATION EOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) ' TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID*N0:8: 

CTACGCGGAT CCGCCATCAT GCTTCAGCTT TGGAAACTTG TTC 

• (2) INFORMATION EOR SEQ ID NO: 9:, 

(i). SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs - 
■(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE .TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO-: 9: 
CTCTGCTCTA GACTTAAATGA GGGTTTGCAG C 
(2) INHDRMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 449 b^se pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: 'linear 

(ii) MOLECULE TYPE:. DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GGCACGAGAT TTCATGAGCA TCCTCCTCTA AACGCGTGTC AAGACAAAAG ATGCHTCAGC 

TTTGGAAACT TGTTCTCCTG ' TGCGGCGTGC TCAGTG(3GAC CTCAGAC3TCT CTTCTTC3ACA 

ATCTTC^GCAA TGACCTAA(3C AATGTCGT(3G ATAAGCTGGA ACCTGTTCTT CACGAGGGAC 

TTGAGACAGT TGACAATACTT CTTAAAGGGC ATCCCCNTTT TNGAGAAACT GAAGGTCGAC 

. CTAGGAGTGC TTCAGAAATC CAGTGCTTGG CAACTGGCCA- AGCAGAAGGC CCAGGAAGCT 

GAGAAATTGC T(SAACCAATG TCATTTCTAA GCTTGCTTCCA ACTAACACGG ACATTTTTGG 
GGTGAAAAAT . ClAGCAACTCC CTCIATCCTGG ATGTCAiVAGC* TGAACCGATC" GATGATGGNA 

AAGGCTTAAA CTGIiGAGCTTC CCTGTCANC 

(2) INFORMATION FOR SEQ ID NO: 11: - 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 538 base pairs 

(B) TYPE: nucleic acid • • 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE:: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ATGAGCATCC TCCTCTAAAC GCGTGTCAAG ACAAAAGATG CTNNCAGCTT TGGAAACTTG 
TTCTCCTGTG CGGCGTGCTC ACTGGGACCT CAGAGTCTCT TCTTGACAAT CTTGGCAATG 
ACCTAAGCAA TGTCGTGGAT AAGCTGGAAC CTGTTCTTCA CGAGGGACTT GAGACAGTTG 
ACAATACTCT TAAAGGCATC CTTGAGAAAC TGAAGGTCGA CCTAGGAGTG CTTCAGAAAT 
CCAGTGCTTG GCAACTGGCC AACAGAAGGN CCAGGAAGCT GAGAAATTGC TGAACAATGT 
CATTTCTAAG CTGCTTCCAA CTAACACGGA CATTTTTGGG TTGAAANTCA GCAATNCCCN 
CANCCGGATG TTCAAAGNTG NANCGATCGA TGATGGGCAA AGGCTTTAAN CCGGAGGCTT 
CCCTGTTCAC CGGGAATGTT CAANGTNGGC CCGGGCCCNT CATTGGGCCA GNTTATCAAA 
NCTGGAAAGC TTCCTGGGAC CTCCGGACNG GNTCAACAAT TGAAANGATT CCCCGANA 
(2) . INEXDRMATION FDR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 359 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear - . 

(ii) MOLECULE TYPE: DNA (genomic). 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 
GGCACGAGAT TTCATGAGCA TCCTCCTCTA AACGCGTGTC AAGACAAAAG ATGCTTCAGC 
TTTGGAAACT TGTTCTCCTG TGCGGCGTGC TCACTGGGAC CTCAGAGTCT CTTCTTGACA 
ATCTTGGCAA TGACCTAAGC AATGTCGTGG ATAAGCTGGG AACCTGTTCT TCACGAGGGA 
CTTGAGACAG TTGACAATAC TCTTAAAGGC ATCCTTGAGA AACTGAAGGT CGANCTAGGA 
GTGCTTCAGA AATCCAGTGC TTGGCAACTG GNCAAGCAGA AGGNCCCAGG AAGCTGAGAA 
ATTGCTGGAN CAATGTCAAT TCTAAGCTGN TTCCGACTAA CACGGNCATT TTTGGGTTG 
(2) information' EOR SEQ ID NO: 13: 

(i)- SEQUENCE CHARACTERISTICS: ' ^ ' 

(A) LENGTH: 374 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:. 
ACCCACGCGT CCGGCGTGAT CAACACGCTG AAAAGCACtG TATCCTCCCT. GCTGCAGAAG 
GAGATATGTC CACTGATCCG CATCTTCATC CACTCCCTGG- ATGTGAATGT CATTCAGCAG 
GTCGTCGATA ATCCTCAGCA CAAAACCCAG CTGCAAACCC TCATCTGAAG AGGACGAATG 
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AGGAGGACCA CTGTGGTGCA TGCTGATTGG TTCCCAGTGG CTTGCCCCAC CCCCTTATAG 
CATCTCCCTC CAGGAAGCTG CTGCCACCAC CTAACCAGCG TGAAAGCCTG GAGTCCCACC 
AGAAGGACCT TCCCAGATAC CCCTTTTTCC TCACAGTCAG AGGNGGNNGC CTCTTACACN 
TGTTGTCCNG GCCC 

(2) INFORMATION EOR SEQ ID NO: 14: 

(i) SEQOENCE CHARACTERISTICS: 

(A) LENGTH:- 406 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOEOIOGY: linear 

(ii) MOLECULE TYPE: .DNA (genomic) 

(xi) .SEQUENCE DESCRI'PTION : SEQ ID NO: 14: 
TTCATGAGCA TCCTCCTCTA AACGCGTGTC AAGACAAAAG ATGCTTCACT TTGGAAACTT 
GTTCTCCTGT'GCGGCGTGCT CACTGGGACC TCAGAGTCTC TTCTTGACAA TCTTGGCAAT 
GACCTAAGCA ATGTCGTGGA TAAGCTGGAA CCTGTTCTTC ACGAGGGACT TGAGACAGTT 
GACAATACTC TTAAAGGCAT CCTTGAGAAA CTGAAGGTCG ACCTAGGAGT GCTTCAGAAA 
TCCAGTGCTT GGCAACTGGG CCANCAGAAA GGCCCAGGGA AAGCGGAGAA ATTGCTGGAA 
CAATGTTCAT TTCTAAAGCT GCTTTCCAAC TAACACGGGA CNTTTTTGGG GTTTGNAAAA 
TCAGCCAACT TCCCTCAACC NNGGATGTTC CAAAGCTGGA AACCGN 
(2) INEORiyEATION EOR SEQ ID N0:15:- 

(i) SEQUENCE CHARACTERISTICS: 

(A) - LENGTH: 493 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear ' - . ' 

(ii) NDLECULE TYPE: DNA (genomic) . - 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TCGACCCACG CGTCCGCCGC CCCCAACTTT TTGGCAGTTC TCCCATCTCT TGCCCACTTG 
ACCAGACTTT AATAGTTCCC TGTGTTTTCC AGACACAGCC AAATCATCAA CAAGTTCGTG ; 
AATAGCGTGA TCAACAGGCT GAAAAGCACT GTATCCTCCC TGCTGCAGAA GGAGATATGT 
CCACTGATCC GCATCTTCAT CCACTCCCTG GATGTGAATG'TCATTCAGCA GGTCGTCGAT 
AATCCTCAGC ACAAAACCCA GCTGCAAACC CTCATCTGAA GAGGACGAAT GAGGAGGAGC 
ACTGTGGTGC ATGCTGGTGA GGAGCCAGTC TCTGTGCCCC AATGCACAGG GGCCTATGGT 
GAAGTAAAAG TCAAGCGTGG CTTCCCTTAT TTTTGTGTTA. GAAGACTGTG CCTTCATCTC 
AGTCATAGAT TGAGCCCTGG NCCCCATCCC ANGCTAAGGC CTGATTCTGG TCANACTCTG . 
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AACACTGAGC CTT -4 93 

(2) INEORMATION EOR SEQ ID NO: 16: 

(i)- SEQUENCE CHARACTERISTICS: * 

(A) LENGTH: 395 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) . TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genoirdc) 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

TCGACCCACA CGTCCGCCTG ACTGACAGAA GGAGATATGT CCATGATCCG ACATTCTTAC 60 

ATCCACTCCC TGGATGTGAA TGTTCATTNC AGCAGGTCGT NCGATAATCC NGCAGCACAA 120 

AACCCAGCTG GCAAACCCTN CATCTGAAGA NGACGAATGA GGANGACCAC TGTGGGTGCA 180- 

TGCTGATTGG TNTCCCAGTG GGCTTGCCCC AACCCCCTTA NAGCANCTCC CTCCAGGAAG .240 



t^j ' 25 CTGCTGCNAA CCAACCGAAC CAGCGTGAAA GCCTGAATNC CACCAGAAGG ACCTTCCCAG 300 

n\ ' ■ ' ' ■ • , • 

ATANCCCTGC TNCNCAACAG TNAAGAACAG CAGCTTCGAA CAACATGNGG TTCTGGCCCC 360 



CGGGCAATAA AAGGCCCATT TTGGCAAAAA AAAAA . 395 
(2) INEIORMATION FOR SEQ ID NO: 17: 



Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 base pairs 
35 (B) TYPE: nucleic^ acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 17 : 
GGCACGANAT TTCATGAGCA TCCTCCTCTA AACACGTGTC AAGACAAAAG ATGCTTCAGC 60 
45 TTTGGNAACT TGTTCTCCTA TNCNGCGTGC TCACTGNGAC CTCAGAATCT CTNCTT 116 

(2) INTORMATION FOR SEQ ID N0:-18: 

t. 

(i) SEQUENCE CHARACTERISTICS : 
50 . . - (A) LENGTH: 360, base pairs-. 

■ (B) TYPE: nucleic acid 

(C) STRANDEDNESS:' single ■ . . 

(D) TOPOLOGY: linear 

55 . (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

GTCSGNCCGGG (3CCCATCATT (3GGCCAGANT TATCAACCTG AAAGCCTCCN NGGANCTCCT 60 

60 

GACCGCAGTC AACAATTGGA AACTGGATCC CCCAGAACAA CAACCAGCCT GGTTGNCCGT 120 

"NCTGGGGAAG AATGCCGNCC AATGAANCCC AAACCAAGCA NCTTCACTNN TNCCTN(3GCT 180. 

65 GGGGACCAAA ACACCAGGCC AAAATCCANT NAANCAAGTN TCCGTGGNAA TAAGCGTGGA 240 

ATCCIAAACAA CGCTGCSGAAA AAGCANTGGG NATNCCH^TCC CtGGCTGG(3C AAGAAAGGGN 300 

GATATGGTCC ACTGGAATCC C3GAATTTTTA ANCCAATTCC CTNGGAATGT G(3NAATGTCA 360 



